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RAW SEQUENCE LISTING DATE: 11/19/2004 

PATENT APPLICATION: US/10/621 , 911A TIME: 10:45:21 

Input Set : A:\PTO.FG.txt 

Output Set: N:\CRF4\11192004\J62191lA.raw 

3 <110> APPLICANT: SAITOU, Mitinori 

4 SURANI, Azim 

6 <120> TITLE OF INVENTION: Genes 

8 <130> FILE REFERENCE: 674558-2002 

10 <140> CURRENT APPLICATION NUMBER: 10/621, 911A 

11 <141> CURRENT FILING DATE: 2003-07-17 

13 <150> PRIOR APPLICATION NUMBER: PCT/GB02/00215 

14 <151> PRIOR FILING DATE: 2002-01-18 

16 <150> PRIOR APPLICATION NUMBER: GB 0101300.2 

17 <151> PRIOR FILING DATE: 2001-01-18 
.19 <160> NUMBER OF SEQ ID NOS : 26 
21 <170> SOFTWARE : SeqWin99, version 1.02 

23 <210> SEQ ID NO: 1 

24 <211> LENGTH: 617 

25 <212> TYPE: DNA 

26 <2'13> ORGANISM: Mus musculus 

28 <400> SEQUENCE: 1 . 

29 gccgcagaaa gggcagaccc gcagcgcgct ccatcctttg ccctccagtg ctgcctttgc 60 
3 0 tccgcaccat gaaccacact tctcaagcct tcatcaccgc tgccagtgga ggacagcccc 12 0 

31 caaactacga aagaatcaag gaagaatatg aggtggctga gatgggggca ccgcacggat 180 

32 cggcttctgt cagaactact gtgatcaaca tgcccagaga ggtgtcggtg cctgaccatg 240 

33 tggtctggtc cctgttcaat acactcttca tgaacttctg ctgcctgggc ttcatagcct 300 

34 atgcctactc cgtgaagtct agggatcgga agatggtggg tgatgtgact ggagcccagg 360 

35 cctacgcctc cactgctaag tgcctgaaca tcagcacctt ggtcctcagc atcctgatgg 42 0 

36 ttgttatcac cattgttagt gtcatcatca ttgttcttaa cgctcaaaac cttcacactt 480 

37 aatagaggat tccgacttcc ggtcctgaag tgcttcaccc tccgcagctg cgtccctcct 540 

38 tgcccctccc tacacgcagg tgtaacactc atttatctat ccacagtgga ttcaataaag 600 

39 tgcacttgat aaccacc 617 

41 <210> SEQ ID NO: 2 

42 <2 11 > LENGTH: 137 

43 <212> TYPE: PRT 

44 <213> ORGANISM: Mus musculus 

46 <400> SEQUENCE: 2 

47 Met Asn His Thr Ser Gin Ala Phe lie Thr Ala Ala Ser Gly Gly Gin 

48 1 5 10 15 

50 Pro Pro Asn Tyr Glu Arg lie Lys Glu Glu Tyr Glu Val Ala Glu Met 

51 20 25 30 

53 Gly Ala Pro His Gly Ser Ala Ser Val Arg Thr Thr Val lie Asn Met 

54 35 , 40 45 

56 Pro Arg Glu Val Ser Val Pro Asp His Val Val Trp Ser Leu Phe Asn 

57 50 55 60 

59 Thr Leu Phe Met Asn Phe Cys Cys Leu Gly Phe lie Ala Tyr Ala Tyr 

60 65 . 70 75 80 
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62 Ser Val Lys Ser Arg Asp Arg Lys Met Val Gly Asp Val Thr Gly Ala 

63 85 90 95 

65 Gin Ala Tyr Ala Ser Thr Ala Lys Cys Leu Asn lie Ser Thr Leu Val 

66 100 105 110 

68 Leu Ser He Leu Met Val Val He Thr lie Val Ser Val He He He 

69 115 , 120 125 

71 Val Leu Asn Ala Gin Asn Leu His Thr 

72 130 . 135 

74 <210> SEQ ID NO : 3 

75 <211> LENGTH: 823 

76 <212> TYPE: DNA 

77 <213> ORGANISM: Mus musculus 

79 <400> SEQUENCE: 3 

80 ggatcacaga ctgactgcta attgggtctt ggttttaggt cttttcaaag actaagcaat 60 

81 cttgttccga gctagctttt gaggcttctg cccatcgcat cgccatggag gaaccatcag 12 0 

82 agaaagtcga cccaatgaag gaccctgaaa ctcctcagaa gaaagatgaa gaggacgctt 180 
83. tggatgatac agacgtccta caaccagaaa cactagtaaa ggtcatgaaa aagctaaccc 240 

84 taaaccccgg tgtcaagcgg tccgcacgcc ggcgcagtet acggaaccgc attgcagccg 300 

85 tacctgtgga gaacaagagt gaaaaaatcc ggagggaagt tcaaagcgcc tttcccaaga 3 60 

86 gaagggtccg cactttgttg tcggtgctga aagaccctat agcaaagatg agaagacttg 42 0 

87 ttcggattga gcagagacaa aaaaggctcg aaggaaatga gtttgaacgg gacagtgagc 480 

88 cattcagatg tctctgcact ttctgccatt atcaaagatg ggatccctct gagaatgcga 540 

89 aaatcgggaa gaattaggag cttacattgt acgctgccct. ggctgtcgac gatgccgcac 600 

90 agcagatgtg aaagctattt tttgtttaag attaaacttt ttctggtgct gggaaatctt 660 

91 aacttgttaa cctttaaatt gtagatagga tgcacaacga tccagattta tgtgaagttt 72 0 

92 agaagcctca agctgtgagg cccagggctg aggaataaag taaatagaat ttggagtatg 780 

93 tacgttctaa tttccagaaa tttgtaataa aagcattttt gtt 823 
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Output Set: N:\CRF4\11192004\J621911A.raw 

126 130 135 140 

128 Ala Lys lie Gly Lys Asn 

129 145 150 ' 

131 <210> SEQ ID NO: 5 

132 <211> LENGTH: 4925 

133 <212> TYPE: DNA 

134 <213> ORGANISM: Rattus sp 

136 <400> SEQUENCE: 5 ' ' 

137 cgcccgcccc cccccccccc cctccccccc cccccacctc cgacgtatga tggctcctag 60 
13 8 acgcaacacg aagcggactc cccgcatcat tcacgtagac ccgccttctg ctttccctgt 12 0 

139 cggggttttg ggaagcccgg cggccctctc ttctcacctt gctccactag cacgcggctg 180 

140 ttttcactga gcccagcact ggctaagtgg agcaccagga gtttcaggct atccttcaga 240 

141 gggcaaggtg tagtccatgg tgggctacag gagaccctct ctctccgtga gtacagagag 300 

142 gcaaacccaa gccagacagg ggtgatgatt aggaacatac cttcgtcggg gagaaaatac 360 

143 cggttcatat aggaataaga ggaaccagga ggtagttaag gctgtggtgt ctggttgcgg 42 0 

144 ggtttttgac tctcaacaac cacgttcaga acgtgctgag tttttatgat ggtgtagaat 480 

145 ttccttatca gcaattggtc tccgcggtgt ttctttttct tttttaattt tttaagtata 540 

146 atttggtgtt tgaagcaact gtacttggac tagaactccc tgtgtaatcc agaatggaat 600 

147 cccaaatcct aggattaaag gttttagtgg gctgcagtgt tgggtggggg ttgttttgat 660 

148 tacgttgtag cccaggctgg gctcaatctc aatcctcctg cctctgcctt ctaaacgcta 720 

149 ggattaaaag tgctgcgcca tgatcctgct gtagctttat ttttatttat ttatttattt 780 

150 attttggctc tttttttttg gagctgggga ccgaaccgag ggccttgtgc ttcctaggca 840 

151 agcgctctac cactgagcta aatccccaac cccagtgtag ctttattttt aagaacagga 900 

152 gtcttgtttc tcaaaacagt ttctctgtag ccctggttgt cctggaactc cgtaaaccag 960 

153 gctggtttgg gactctgcct ttaaaacact gggactaaag gcggtaccac ctccgtgggc 102 0 

154 tacaccggaa tcttttaagc ttcatttgaa ccggggcttt ttctttttct cacccacttt 1080 

155 c t ggaagcga ttttcctgct aaatttccat tcctggtaaa tgactctgag gggaaatagg 1140 

156 aacccagaat agattgagcc gggggctacc tgggaccccg cactccccac cccccagccg 1200 

157 ctgttgaagc tctttgcctg aggggcctcc gggtttgata cctcctagca ctccgggctg 1260 

158 agggcgtggc tcgggaggag ccattccttt ggagaggaaa acaactgctg gccttgaatc 1320 

159 tgccctaata cctgacagtt acatgggacc tccttatttc cacaggattc tttagtcttt 1380 

160 gtttgggaga ttttcaaatc ttgagactgc tcaacccttc ctggcctaac actcacaagg 1440 

161 ccaggctaga cccaaattct gtcaacccct tctgtgtcca aaacggtggg tggctagctg 1500 

162 gctcaccctt ggtgtcactt tgctttaaca ttcggaaaag ttgtggtaag tttcctgtat 1560 

163 aaaataggac catctactgg gtgtggtccc atgtaaagca aggttggttt cccaaaatac 1620 

164 cctgtttaca tagatgtccg gaagcattgg agcaggtcaa ttagatttag gtggaaacag 1680 

165 cctgtttttg gaaagctttc cagggcggaa aatgaaccca gaggcactat tgggcaagcc 1740 

166 ctccggctaa gcaacacaat tggctgcagg ggtctctgga agaggtgtga gacaagagag 1800 

167 aatatgcagg tttcaggacc tctgaactag agttaggctg ctgtaacatt gtaacattgc 1860 

168 tgtaagcaga acagcccatg gtaagaagct cagtggatct ctacaaacac taggatatct 1920 

169 gctcagggtt tatgaccagg ccctgtgcat atggtttgct tcttgttggc ccctctcttg 1980 

170 aagaggggtg attatctgtt acccacttcc ttgtttctct ggggtattac cttgcaaaat 2040 

171 gcaaaatgat atacttcact aatgtctcca tcttctgttt cagaaatcct acaaccagaa 2100 

172 acactagtaa aggtcatgaa aaagctaacc ctgaacccca gtgccaacjcc gacaaaatat 2160 

173 catcgtcgtc aaagggttcg tctccaggtt aagagccagc ctgtggagaa cagaagtgaa 2220 

174 agaatcatga gggaagttca aagcgccttt cccaggagaa gggtccgcac tctgttgtcc 2280 

175 gtgctgaaag accccatagc aaggatgaga agatttgttc gggtgagttg cgtttgtggg 2340 

176 cggggcatag atctaagagc aactctagcc tcaggaatgg cacctaggtt aaacagggaa 2400 

177 tgtagacaag gatagtgact acctgtgatt cccagctcaa gaaaacaagc tccaaggcta 2460 
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178 tcctctactg cgcagtctga agctggccag agctatatgc aaattgataa gtcagtataa 2520 

179 catttatttt tggattttca gactccctcc ccatagtcca aactggccct ccagttcagt 2580 

180 ccacggtcct gcttcttccc cggtgctagg cttttgagtg ataaggctga cttagactgg 2640 

181 atctcagagc tgaagtggac ctgttagtct ttgtagacca ggctggggtg gtttctgctt 2700 
182. tctcagcgcc tagctcacat agtaggcatt ttaactttgt cttaatagta atttgagtaa 2 760 

183 ttttgttttt ctcttgaaga ttgagcagag acaaagacag cttgaaggaa atgaggtaaa 2 82 0 

184 tgcatatgga tgggtagggt gtctatggat gggtagggtg tcttgttttt actgtttcct 2880 

185 tagacaagga gtgtgtatgt ggagagttac cttctcaaca cagggaatct ggttattaaa 2 940 

186 gcagtacttt aaaaataaat aaaataaata aaataaaaat aaagcagtag aaggggattt 3000 

187 acatttcttt tgagttgcaa tatcctgatt aacatttttc tttcagagac gagatgagcc 3060 

188 attcagatgt ctctgcactt tctgccatta tcagagatgg gatccttctg^ agaatgctaa 3120 

189 aatcgggcag aaccagaaga attagggcag tttgaattgt acaccgtcct tgccgttaac 3180 

190 ggtgccatgc agcagatgtg aaagctgttt ttttgtttaa gattaaactt ttcttggtgc 3240 

191 tggggaaatc tcttctaatt gctaaccttt aaattatata ggatgtgtga catttggatt 3300 

192 catgggaatg acagatttac ccaagaattg agcatgagtc aaagcctggt agtttgattt 3360, 

193 agaaggtaat tggaataaat ctttttattt tagattttct agtttgcaga gaaatttgta 3420 
-194 . ataaaggcaa atttgttatc tttaataaat acagaacaga ttagaatgag ccattggaga 3480 

195 tgggggactc gttttttaca ggtgcatgtg tgggtgtgtg atgttcagag ttcaatgtgt 3540 

196 gctaccctgt atttctgctt gaggcaaggt ctccatgagg cctagctggt ctaactcctg 3600 

197 gtcctgcctt ttgttttccc ctgagttttg acaccatagg cttgtcggca agatctggaa 3660 

198 gaggcttgat gtttgtgttt gtgctgtgta ataaacaatt ggttgacata ttcctaaagt 3720 

199 gtggcactgt attgacctgt ctgtctcatg aggaagttaa tgaccggagc ataattgtat 3780 

200 gctttatttc ctgagagaag tgtcaggaaa ggaggagtta ggaagaaagc cccaggctgg 3840 

201 ggttaagagc actggctgct tttccagagg tcctgagttc aattcccagc aatcacctgg 3 900 

202 tggctcccga acatctgtaa caggatccaa tgccctcttt tggtgtgtct aagaactccc 3960 

203 taggcatgca gaggattttt gtttttgttt tttttttttt tttttttttt ttcgtttttt 4020 

204 tcagagctgg 3 ggaaccgaac ccagggcctt gcgcttgcta agcaagcgct ctaccactga 4080 

205 gctaaatccc caacccctac aatggccttt ttctacctgc ttttgaatta tcaataaaag 4140 

206 actggggcaa aagaaaggct ggagtgaatg agagagaaca tgtgaagagt aaatgagaga 4200 
2 07 gagcatgagg gaatgaatga gagagtgaat gtgagaacga atgtgagagc gagtgagaga 42 60 
208 aeatgagaag aacacgttaa gagtgagtga agagagaatg tgaggtgtgt atgaagattg 4320 
20 9 tgtgtggggt tggggattta gctcagtggt agagtgcfetg cctaggaagc acaaggccct 4380 

210 gggttcggtc cccagctcca aaaaaaagac ccaaaaaaaa aaaaaaaaaa aaagattgtg 4440 

211 tgtgtgtgtg aaaggagagt gcatgtggtg tgtgtgagat atgtgcaagg tgtgtatcaa 4 500 

212 gagtgtgtgt gagagtgaaa gggtaatgaa cagaggtgtg catgagcgtg ggagtttgag 4560 

213 aaaagaaaac agcaataaaa aaaaaagcag agtgcacgag agaatgcaga gtgtgtgcaa 462 0 

214 cctcaagctg agacagagac agagagaaag agagagagag agagagactt taagccttga 4680 

215 aattacctgt cagtttgtac ccaaatagta gtctgtgtat atttattttg agecttccag 4740 

216 atccctgctt ccagtggaga actctgattc tatgttgagg ctggaccctg gcaatagtgg 4800 

217 gcttcttgaa aaatagtcaa aggaaacagt gctacaccat ggacttaagc ctttagactc 4860 

218 agttctggct tcaagagcag ctgtcagaaa ataagtgatg aactacttgc agtcgaactc 4920 

219 gaatc 4925 

221 <210> SEQ ID NO: 6 

222 <211> LENGTH: 1444 

223 <212> TYPE: DNA 

224 <213> ORGANISM: RattUS sp 
226 <400> SEQUENCE: 6 

227 • ccaggattca gacgagctag gcctcatgca tggagacctt gcctcaagca gaaataaaca 60 

228 gggtagcaca cattgaactc tgaacatcac gagtgtgcac acacccacac atgcatctgt 12 0 
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229 aaaaaacgag tccccatctc caatggctcg ttctaatctg ttctgtgtat ttattaaaga 180 

230 taacaaattt gcctctatta caaatttctc tgcaaactag aaaatctaaa ataaaagatc 240 

231 tattccaatt accttctaaa tcaaactacc gggctttgac tcatgctcaa ttcttgggta 300 

232 aatctgtcat tcccatgaat ccaaatgtca cacatcctat ataatttaaa ggttagcaag 360 

233 tagagatttc cccagcacca agaaaagttt aatcttaaac aaaaaaacag ctttcacatc 420 

234 tgctgcatgg caccgttaac ggcaaggaca gtgtatgatt caaactgccc taattcttct 480 

235 ggttctgccc aattttagca ttctcagaag gatcccatct ctgataatgg cagaaagtac 540 

236 agagacatct gaatggctca actcttctct catttccttc aagctgtctt tgtctctgct 600 

237 caatccgaac aaatcttctc atccttgcta tggggtcttt cagcaccgac aacagtgtgc 660 

238 ggacccttct cttgggaaag gcgctttgaa ctcccctcat gattctttca cttctgttct 720 

239 ccacaggctg gctcttaatc tggagacgaa ccctttgacg aagatgatat tttggccgat 780 

240 tgagatagaa tatcaaaaca acatttaaca tttaaataac ttaacgatat acacaccttt 840 

241 tttttttcca cctccccaca cagacaaaaa acaaccctat tttttcttta caaccccgcc 900 

242 taagcaagcg aagcattagt aactgaccaa tcatagaaag gaaacaGcac cagaccacat 960 

243 caaataaaat aaaatcaccg cccaacccca cccctataaa aaacccgccg accacaccac 102 0 

244 atatactccc ccccccccgc accatcacta catcaccctc tccacccatt cccacctccc 1080 

245 cccccaacat taaccccacc ccatcacgga aacccccaac accaacaaat aaattagaca 1140 
•246 catcgcatta cataaattga cacaagaccc accccaaaag agcagcaaag attagagcca 12 00 

247 catcctcggc ccaacacaat acactcaacc tgcatagtat ctatctccac cccaacctag 1260 

248 aaacaaaaat ctaatcagca ccaggcaccc aagtatcacg cacactcaaa aacataccca 1320 

249 ccaattaaac acgccccacc cacccaacaa cccacccgcc tgacaacaca cttcggaact 1380 

250 accctcaaca tcaccaaaag caatcgcaag ttacgatgac tccaaccacc tcactctctc 1440 

251 attg 1444 

253 <210> SEQ ID NO: 7 

254 <211> LENGTH: 7656 

255 <212> TYPE: DNA ' 

256 <2 13 > ORGANISM : Rattiis sp " ^ " r " 

258 <220> FEATURE: 

259 <221> NAME /KEY: misc_f eature S 

260 <222> LOCATION: (7471) (7471) 

261 <223> OTHER INFORMATION: "n" is an unknown nucleotide 

263 <220> FEATURE: 

264 <221> NAME /KEY : misc_f eature S 

265 <222> LOCATION: (7554) (7554 ) / 

266 <223> OTHER INFORMATION: "n" is an unknown nucleotide 

268 <220> FEATURE: 

269 <221> NAME/KEY: misc_f eature / 

270 <222> LOCATION: (7608 )..( 7608 ) / 

271 <223> OTHER INFORMATION: "n" is an unknown nucleotide 

273 <400> SEQUENCE: 7 

274 ctgcaagtag ttcateattt acagatcaaa agaaagaaga ataaaaaaac aaggtgtcat 60 
2 75 gatccctcca aaagagtgga acacttcaac tgccagatcc aagatactga aatgggtagc 12 0 

276 atgctggaga aagaattcaa aagttaggta gagaatctgg ttgagcagag cacttgcttt 180 

277 tcttccagag gatctgagtt caagtcccag gacctatatc acagttttct gtaactctag 240 
2 78 ctccagaggg tctgacactt ctgttcactg tgggcacctg cattcacaga caaacataaa 3 00 

279 gtagttcatc acccttttca cagaaaaccc acagcatgtg aggaaatccg ggtctctgcg 360 

280 caatgccccc acagcagaag gggggagctg gagagatggt tcatctgtta gcccatttat 420 
,281 tgctcttgaa gagaacccag ggtcatccat agcacccata gcagctcaca accatctcca 480 . 
282 gttccaggag atccaatgcc ctgttgtgac ctcaggtacc aggcatacac aatgaacctg 540 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 11/19/2004 

PATENT APPLICATION : US/10/621 , 911A TIME: 10:45:22 

Input Set : A:\PTO.FG.txt 

Output Set: N:\CRF4\11192004\J621911A.raw 



Please Note: 



Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of' each/ sequence which presents at least one n or Xaa. 

Seq# :7; N Pos . 7471, 7554, 760(8 
Seq#:8; N Pos. 2115,2142,2143,2146 
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VERIFICATION SUMMARY DATE: 11/19/2004 

PATENT APPLICATION.: US/10/621 , 911A TIME: 10:45:22 

Input Set : A:\PTO.FG.txt 

Output Set: N:\CRF4\11192004\J621911A.raw 

L:398 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:7 after pos.:7440 
M:341 Repeated in SeqNo=7 

L:464 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:8 after pos.:2100 
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